- Home
- Search Results
- Page 1 of 1
Search for: All records
-
Total Resources6
- Resource Type
-
0006000000000000
- More
- Availability
-
60
- Author / Contributor
- Filter by Author / Creator
-
-
Ji, Ziwei (5)
-
Telgarsky, Matus (5)
-
Srebro, Nathan (2)
-
Hu, Yuzheng (1)
-
Ji Ziwei (1)
-
Li, Justin (1)
-
Mehta, Ruta (1)
-
Srebro Nathan (1)
-
Telgarsky Matus (1)
-
#Tyler Phillips, Kenneth E. (0)
-
#Willis, Ciara (0)
-
& Abreu-Ramos, E. D. (0)
-
& Abramson, C. I. (0)
-
& Abreu-Ramos, E. D. (0)
-
& Adams, S.G. (0)
-
& Ahmed, K. (0)
-
& Ahmed, Khadija. (0)
-
& Aina, D.K. Jr. (0)
-
& Akcil-Okan, O. (0)
-
& Akuom, D. (0)
-
- Filter by Editor
-
-
null (1)
-
& Spizer, S. M. (0)
-
& . Spizer, S. (0)
-
& Ahn, J. (0)
-
& Bateiha, S. (0)
-
& Bosch, N. (0)
-
& Brennan K. (0)
-
& Brennan, K. (0)
-
& Chen, B. (0)
-
& Chen, Bodong (0)
-
& Drown, S. (0)
-
& Ferretti, F. (0)
-
& Higgins, A. (0)
-
& J. Peters (0)
-
& Kali, Y. (0)
-
& Ruiz-Arias, P.M. (0)
-
& S. Spitzer (0)
-
& Sahin. I. (0)
-
& Spitzer, S. (0)
-
& Spitzer, S.M. (0)
-
-
Have feedback or suggestions for a way to improve these results?
!
Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher.
Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?
Some links on this page may take you to non-federal websites. Their policies may differ from this site.
-
Ji, Ziwei; Srebro, Nathan; Telgarsky, Matus (, Proceedings of Machine Learning Research)null (Ed.)We present and analyze a momentum-based gradient method for training linear classifiers with an exponentially-tailed loss (eg, the exponential or logistic loss), which maximizes the classification margin on separable data at a rate of O (1/t^ 2). This contrasts with a rate of O (1/log (t)) for standard gradient descent, and O (1/t) for normalized gradient descent. The momentum-based method is derived via the convex dual of the maximum-margin problem, and specifically by applying Nesterov acceleration to this dual, which manages to result in a simple and intuitive method in the primal. This dual view can also be used to derive a stochastic variant, which performs adaptive non-uniform sampling via the dual variables.more » « less
-
Ji Ziwei; Srebro Nathan; Telgarsky Matus (, International Conference on Machine Learning)
-
Ji, Ziwei; Li, Justin; Telgarsky, Matus (, Advances in neural information processing systems)
-
Ji, Ziwei; Srebro, Nathan; Telgarsky, Matus (, international conference on machine learning)
-
Ji, Ziwei; Mehta, Ruta; Telgarsky, Matus (, International Conference on Web and Internet Economics)
An official website of the United States government

Full Text Available